Model Selection

Large model inference optimization

# Large model inference optimization

Llama 4 Scout 17b 16e It Gguf

An image-text to text conversion model built on the Meta Llama base model, supporting interaction through gguf-connector and llama-cpp-python.

Llama 3.1 70B Instruct GGUF

An ultra-low-bit (1-2 bit) quantized model based on Llama-3.1-70B, utilizing IQ-DynamicGate technology for adaptive precision quantization, enhancing accuracy while maintaining memory efficiency.

Large Language Model Supports Multiple Languages

Featherless Ai.qwerky QwQ 32B GGUF

Qwerky-QwQ-32B is a large language model with 32B parameters, specializing in text generation tasks.

Large Language Model

Sky T1 32B Preview GGUF

Sky-T1-32B-Preview is a 32B-parameter large language model, quantized using llama.cpp's imatrix, suitable for text generation tasks.

Large Language Model English

Mixtral 8x22B V0.1 GGUF

Quantized version of Mixtral-8x22B-v0.1, using llama.cpp for quantization, supporting multiple languages and quantization types.

Large Language Model Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase